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Abstract. A noncommutative Fornasini-Marchesini system (a multi-variable version of a linear 
system) can be realized within a weak Markov process (a model for quantum evolution). For a 
discrete time parameter this is worked out systematically as a theory of representations of structure 
maps of a system by a weak process. We introduce subprocesses and quotient processes which can 
be described naturally by a suitable category of weak processes. A corresponding notion of cascade 
for processes induces a represented cascade of systems. We study the control theoretic notion of 
observability which turns out to be particularly interesting in connection with a cascade structure. 
As an application we gain new insights into stationary Markov chains where observability for the 
system is closely related to asymptotic completeness in the scattering theory of the chain. This 
motivates a general definition of asymptotic completeness in the category of weak processes. 



1. Introduction 

Recently there has been much attention for a certain multi- variable version of linear system theory 
which presents evolution equations of the form 

x(ak) = Ak x(a) + Bku(a), 

y(a) = Cx(a)+Du(a). 

This is called a noncommutative Fornasini-Marchesini system in [!, 0, [|| . Here a € Ff where Ft is 
the free semigroup with d generators (which we denote 1, . . . , d and d € N or d = oo). The elements 
in F^ are words in the letters l,...,d, including the empty word 0. Composition is defined by 
concatenation of words, for example ak in the formula above is the concatenation of a word a and a 
generator k € {1, . . . , d}. Now u, x, y are functions on F^ taking values in vector spaces 14, X, y and 
A k : X -> X, B k : U -> X, C : X -> y, D : U -> y are linear operators. For d = 1 we have F£ ~ N 
and this is the classical setting of linear system theory (state space models). The functions u,x,y are 
interpreted as input, internal state, output of the system. 
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It has been established that many important mathematical concepts and results in linear system 
theory generalize nicely for all d, see [2[ . We mention two such concepts which will later be studied in 
this paper. The first is the observability map 

O c ,a -=(CA a ) aeF} 

which is a linear map from X into the y~ valued functions on F^ . (In this paper we use the convention 
that for any variables Xi, . . . , Xd and a word a — ol\ . . . a n G Ft we have X a 
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X a := X an . . . X ai , if a = we interpret it as an identity.) By studying the observability map we can 
find out what we are able to know about the internal space X by observing the output. 

The second concept we want to mention here is the transfer function which is a description of how 
inputs are transferred into outputs. In the multi-variable context above the transfer function can be 
defined as a formal power series 

oo 

T{z):=T(z 1 ,...,z d ):= ^ T a z a := D + C^zAf^zB = D + C(I X - zA)~ l zB 

c,£F+ r=i 

where z = (z\ Ix , ■ ■ ■ , Zd Ix) with indeterminates Z\, . . . ,Zd freely noncommuting among each other but 
commuting with the linear maps, A — (A\, . . . , Aj)*, B = (B\, . . . , BdY, and t denoting a transpose 
of the row vectors so A and B are column vectors of linear maps. In the classical case d = 1 we call 
the single variable z and then this reduces to a familiar formula which gives the transfer function as 
an analytic function. Explicitly 

( D if a = 

T a = l CB a i/H = l 

[ C A ar . . . A a2 B ai if a — a\ . . . a r , r = \a\ > 2 

We come back to this at the end of Section|3l See also [HHH for further discussions of similar formulas. 

Mathematical system theory is an abstraction from the physical dynamics. But of course its rele- 
vance to the real world depends on the fact that such a physical dynamics exists in the background. 
The basic idea behind our work is the proposal that the physical dynamics which implements a multi- 
variable system theory (such as the one sketched above) is described by processes describing quantum 
physics. In (2~3 | we started a project to investigate such connections, compare further [13] for a recent 
generalization of the model in 24( and also |25| for some complementary work. In these papers the 
quantum processes are based on infinite tensor products which are natural from the point of view of 
approximating the Fock spaces of continuous time processes but which sometimes obscure the more 
fundamental level where the connections with operator and system theory originate. In the present 
paper we go directly to this level by investigating theproposal that it is the concept of 'weak Markov 
processes', worked out by Bhat and Parthasarathy in [8, 9] to catch the most basic features of quantum 
Markov processes, which provides a systematic and very general point of departure for our project. 
The connections to operator theory are evident here because the concept of a weak Markov process 
may be seen as operator theoretic dilation theory studied from a probabilistic point of view. Dilation 
theory will be mentioned only in side remarks in this paper however, the emphasis lies on the structure 
of the full process. The benefits of such studies go in both directions: access to operator and system 
theory tools for concrete quantum models and guidance for the development of the general theory from 
the questions arising in the models. To achieve such an interaction between the disciplines a more 
systematic development of the theory is needed. The following sketch of the contents of this paper 
should be read with this motivation in mind. 

In Section[2]we start with a self-contained but concise description of the basic theory of weak Markov 
processes in discrete time: The dynamics is described by a row isometry and then from a co-invariant 
subspace certain structures emerge which can be given a quantum probabilistic interpretation, such 
as transition operators and weak nitrations. Most of this is well known but note how in the end we 
give an operational meaning to the elements of the free semigroup F£ by interpreting them as certain 
measurement protocols. 

In Section [3] we define the notion of a representation of structure maps A,B,C,D (as described 
above) by a weak Markov process and in this way we get an explicit systematic procedure to identify 
multi-variable linear systems (as described above) within quantum physical models. At this point it 
remains quite abstract. As a preparation for seeing representations of structure maps in use we first 
introduce in Section |4] the theory of subprocesses and quotient processes of (discrete weak Markov) 
processes and then show that within a suitably defined category of processes this can be reformulated 
as a short exact sequence. The main result here is a classification of extensions appearing in such short 
exact sequences by a construction which we call a 7-cascade of processes. There is a set of contractions 
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from which 7 can be chosen which gives a parametrization of all the ways in which two processes can 
be put together as subprocess and quotient process, with 7 = yielding the direct sum. 

Cascades are a well known concept for systems and their structure maps. The terminology of a 
7-cascade for processes is justified in Section [5] where we show that within a 7-cascade of processes we 
have a representation of a cascade of the original systems and that in fact this construction is a natural 
tool to represent cascades of systems and of structure maps. Let us remark here that the notion of 
cascades and of more general networks of processes has been around for some time in a continuous time 
setting and this theory has been investigated intensely because of promising applications in quantum 
filtering and quantum control 26|, |27| . There is no explicit comparison with continuous time systems 
in this paper, however we provide the basis for such a comparison by describing weak Markov processes 
in terms of product systems which suggests how to build the theory starting from continuous product 
systems. Our justification for concentrating on the discrete time setting here is the same as in Helton's 
seminal paper (29j in which he connects classical system theory (d = 1) with operator models and 
scattering theory, saying: 'We concentrate on discrete time systems because it is for these that the 
relationship is most clear' ( 29], p. 15). 

In fact, very similarly to the path followed by Helton in [29] it is quite natural in our setting to 
connect observability of represented multi- variable linear systems with scattering in quantum physical 
models. We investigate observability in Section [5] and show that in a quantum model it can be given 
the same interpretation as in classical system theory, namely that by observing the outputs we have an 
indirect way to measure and to investigate those parts of the internal space which have the character 
of a black box. We show that we get a particularly interesting situation if we have a 7-cascade and 
if we use the input space of the subprocess as the output space of the cascade, so that looking for 
observability amounts to the question how much we can find out about the cascade by observations 
which only involve the subprocess. The extreme case when we can find out everything about the 
cascade by such special observations we call asymptotically complete. 

We give a convenient criterion for asymptotic completeness in terms of asymptotic properties of the 
transition operator of the process which is a completely positive map. This criterion is a generalization 



of a criterion for asymptotic completeness |21|, [22|, |20] in a scattering theory for operator algebraic 
stationary quantum Markov processes introduced by Kummerer and Maassen in [30]. It turns out 
that our definition of asymptotic completeness for short exact sequences of weak processes can be 
seen as a generalization of the definition of Kummerer and Maassen. Such a connection has been 
hinted at in (HI but the theory developed in this paper allows us to explain the relevant facts in a 
much more conceptual and elegant way. The details are in Section [7] We include the general fact 
that normal invariant states lead to weak subprocesses and we work out how for a stationary Markov 
chain in an operator algebraic setting we can construct an associated weak process together with a 
subprocess (with one-dimensional internal space) and hence we can establish the connection to our 
previous results. This improved conceptual understanding of asymptotic completeness is an excellent 
example of the cross-fertilization between operator and system theory on the one hand and quantum 
probability on the other hand which we have in mind. It uses only a very special case of our general 
theory and suggests further work. 



2. Weak Markov Processes 

Unitary dynamics on a Hilbert space is the most basic way of describing quantum mechanical 
evolution. If causality is taken into account and one restricts the attention to observables belonging 
only to the future (or only to the past) then it becomes natural to study *-endomorphisms of B(H), 
the algebra of bounded linear operators on a Hilbert space %. A convincing argument in this direction 
is presented in [l[, Section 1.2. From now on let : B{%) — » B{T-L) be a *-endomorphism and % a 
separable Hilbert space. Then it follows from the representation theory of B(H) that there exists a 
separable Hilbert space V and an isometry V : H <£> V -> H such that for all X 6 B(H) 



9{X) = V X®t r V* 
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We assume 9 ^ 0, then 9 is automatically injective. See 32| for more details. If (e fe )fc=i ( with d e N 
or d = oo) is an orthonormal basis of "P then with 

V k :=V\ n9eh eB(H), k = l,...,d, 

we can also write 

d 

9(X)=Y,V k XV k * 
fe=i 

(limits to be understood in the strong operator topology if d = oo) which is called a Kraus decomposi- 
tion. The 14 are isometries with orthogonal ranges, so alternatively (and with the same notation) we 
also think of V as a row isometry 

d 

V = (Vx,V 2 ,...,V d ) : 0ft->ft. 

l 

V is called a row unitary if J2k=i ^fe^fc* = ^ an d this is equivalent to 9 being unital, i.e., 9(1) = 1. If 
this additional assumption is used in the following then it will always be explicitly stated. 

Using induction we define row isometries yw : H ® V — >• % (where n G N) by := V and, for 
| G H ® OT 1 P and 77 G 7> 

y(")(|® ?? )=^(y (n - 1) |'(8)ry). 

Later we need the following equivalent description for the adjoints: If £ G "H and F*£ =: ® rj^ G 

then 

n — 1 n 

y (n) *£ = J] ^""^(GO ®r; fe e(H®(g)P)®7> = H®(g)7>. 
fe i i 

It is not difficult to check that this implements the n-th power of 9: 

9 n (x) = v {n] x ® i^np v {n) * = y a xv* . 

aGF^ ,\a\— n 

Here the notation is V* := (V a )* = V* . . . V^ 1 if a = a\ . . . a n and \a\ — n is the length of the word. 
This Kraus decomposition where a sum over all words of length n occurs gives the first connection to 
the multi- variable formalism sketched in Section [TJ 

Remark: The tensor products ( (^)™ 'P) neN appearing in these formulas represent the (discrete) product 
system associated to the endomorphism 9 and though we do not go into continuous time systems in 
this paper it is worth noting that the natural starting point to translate our results to continuous time 
would be to consider continuous product systems and in this way make the connection with the theory 
exposed in [l, 11, 2j|. 



To introduce processes which resemble Markov processes from probability theory we need to specify 
a subspace t) C T-L (by which we always mean a closed subspace if not otherwise stated) . Let us denote 
the orthogonal projection onto f) by p = P^. (In this paper we use consistently the notation Pc for 
the orthogonal projection onto a subspace C.) Given IjcWwe have (with n G No) a family of normal 
*-homomorphisms 

J {n) : E{\)) -> B{H) 
x h> 9 n (xp) 
and the compressions Z n : fi(fj) — » B(f)) defined by 

pJ^(x)p=:Z n (x)p [=jW(Z n (x))] 

which are contractive completely positive maps. For the processes to be defined below the j( n ' play 
the role of noncommutative random variables and the Z n are transition operators. 

The subspace f) of H is called invariant if Vfef) C f) for all k = 1, . . . d and co-invariant if V fe *f) C I) 
for all k — l,...d. The importance of co-invariant subspaces in this context has been observed by 
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many, see for example [14( for various related topics. We note some useful properties equivalent to 
co-invariance. 

Lemma 2.1. The following are equivalent: 

(1) e(p) P = e(t)p 

(2) rj _L ®V) 

(3) f) is co-invariant 

Let us write Z for Z\ and state the following modification of the previous lemma: 

Lemma 2.2. The following are equivalent: 
(V) p < 9(p) 
(?) f) c V(t)®V) 
(3') p < 0(1) and f) is co-invariant 
(4') Z(l 6 ) - 1„ 

Below we prove Lemma \2. II It is then easy to get Lemma \2. 21 bv checking that (£') is nothing but 
(£) together with p < 9(1) (for £ = 1, . . . , 3). The equivalence of (1') and (4') is immediate because, 
by definition, Z(l^) = p9(p)\t ) . 

If V is a row unitary then all the properties in Lemmas 12.11 and 12.21 are equivalent; for example 
(3) and (3') are equivalent for a row unitary because in this case we have 9(1) = 1. This means that 
for a row unitary we can always use the simpler statements (l')-(4') when dealing with co-invariant 
subspaces. 

Proof. Consider an orthogonal projection q onto qH. Then 6(q) = V(q ® l-p)V* is the orthogonal 
projection onto V(q% ® V). Hence 9(1 — p) projects onto V^ 1 - ® V). This gives (1) (2). 

Applying V* to (2) we obtain V*t) _L f)- 1 ® V, hence V*\) C f) ® V and V£t) C f) for all k = 1, . . . d 
which is (3). Conversely, from (3) we get 

d d 

9(p)p = ]T V kP V h *p = V k V h *p = 9(l)p 
i i 

which is (1). □ 

The following definition is consistent with the terminology used by Bhat and Parthasarathy in 
[H, Q where also continuous time and versions with C*-subalgebras of B(t)) are considered which 
allows the inclusion of classical Markov processes by restricting to commutative subalgebras. We only 
consider discrete time steps and focus on the algebra 23(f)) of all bounded operators. This allows a lean 
formulation from which further (probabilistic) features of the processes can be pulled out afterwards. 

Definition 2.3. A discrete weak Markov process is a triple (%, V, f)) where V is a row isometry on 
a Hilbert space T-L which contains the Hilbert space f) as a co-invariant subspace with respect to V . 
Further we assume that 

H :=span{V a \): a G F+} . 
If V is a row unitary then we call the process unital. 

For simplicity we refer to discrete weak Markov processes as processes. Note that if V is originally 
defined on a larger Hilbert space which contains the Hilbert space f) as a co-invariant subspace with 
respect to V then we can always restrict to a space % satisfying the additional assumption which 
ensures that T-L is the smallest invariant subspace containing f). Because f) is co- invariant it follows 
that T-L is even reducing in the original larger Hilbert space. 

Remark: From the point of view of dilation theory V is the minimal isometric dilation of its 
compression to f) and as such it is determined by it up to unitary equivalence, see [33j]. 

Let us now investigate the following increasing sequence of subspaces which is called the weak 
filtration associated to the process: 

q k := 9 k (p), p k := sup(q ,...,q k ), t) k := p k U 
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(for k e N ). Then p = p Q < p 1 < p 2 < . . . and f) = h C f)i C h 2 C . . . 

Note that if the process is unital then q^ = p^ for all k and the arguments simplify. For unital 
processes we have h„ = V'")(f) 53 0"'P) and the inclusion h„ c t), i+1 is mapped by to an 

inclusion I) (g) ®":P C f) ® ®" +1 "P. Hence in this case we have an identification of % with the inductive 
limit of the sequence (f) <8> ®"'P) . 

Proposition 2.4. Let (W, V, t)) be a process. Then 

(a) y(h m ® p) c h m+1 (for all m G N ) 

(b) Z„ = (for all n 6 N) 

(c) If < m < n then p m VW = g m 7W. 

(d) If < m < n then 

p m J {n \x)prn = J im) (Z n - m (x)) 

(a) states an adaptedness property for the process and the filtration while (b),(c) and (d) resemble 
properties of Markov processes in classical probability and hence motivate the terminology 'weak 
Markov process'. Versions of (d) appear in Bhat's papers as the 'weak Markov property'. 

Proof. 9(p m ) is the projection onto V {§ m ®V) ■ Hence (a) is nothing but a reformulation of the obvious 
9(j>m) < Pm+i- We prove (b) by induction, n = 1 is the definition of Z. Now suppose that for n > 2 

Z n _i(a:) =pV( n -Vx®lV t > n - 1 >\ t> = Z n -\x). 

Then we can use (2) of Lemma |2~T1 in the form pV = pV(p <£> 1) to get 

Z n [x) = pV {n) x® l^ (n) *|„ 

= P V[(V {n - 1] ® t)((x ® 1) ® t^V^-V* ® l)]V*\f, 

= pV(p®l)[...](p®l)V*| 6 

= P y(Z"- 1 (a;) ® 1)V*| 6 = Z n (a:) 



To prove (c) we note that (2) of Lemma [2~T1 in the form V(f) <8> "P) _L f) can be iterated to yield 

v w {h ± »'P 9t ) if). 

for all !e N. Hence for all £ < m we find, by applying V( m ~*\ 

which implies p m l /(m, (P 1 ® 1) = 0. 

Together with q m = m (p) = V {m ~> p ®tV^ m > we obtain 

v m v {m) = P^ (ra) p ® i = Pm? m v {m) = q m v {m) . 

The iterative definition of the y( m ) shows that their ranges do not increase if m increases. Hence also 
PmV^ = q m V^ whenever m < n. 

To get (d) we start from the definition of Z„_ m , 

pj^- m \x)p = Z n _ m (x)p, (m<n), 

and apply 9 m to get 

q m J {n) {x)q m = d m (Z n _ m (x)p) = J^{Z n _ m (x)) 
Now because of (b) and (c) we can replace Z n - m by Z n ~ m and q m by p m . □ 

The following observation, also noted in .10]. is crucial for our approach. Bhat remarks in (lo| . 
p. 562, in this context: 'Roughly speaking there is also an additive structure when we deal with general 
quantum dynamical semigroups . . . A detailed study of such systems is yet to be undertaken.' The 
work presented in this paper goes into this direction. 



WEAK MARKOV PROCESSES AS LINEAR SYSTEMS 



7 



Proposition 2.5. Let (T~L, V, t}) be a process. Then 

S : = fa 9 fto = span(f), y(ft ® P)) e ft 
is a wandering subspace, i.e., V^f _L Vp£ if a 7^ j3, and 

w = ft e v a £ , fi„ = ft e v a s 

Proof. To prove that £ is wandering it is enough to show that £ _L V a £ for all a^O. Suppose a^O. 
Because £ _L ft we conclude by (2) of Lemma T2.1l that V/j£ _L f) for all /3 € -Fj~. In particular V^f _L f). 
Writing a = kf3 with fc G {1, . . . , d} and /3 € F+ we hnd V Q f _L V k t). Finally if fc' G {1, . . . , d} but 
k' ^ k then V Q £ _L 14' "H because T4 and 14' have orthogonal ranges. Putting it all together we have 
£ _L V a £. The other assertions are now immediate. □ 

The following results indicate how the V a are related to the quantum physical behaviour of the 
process. For this we apply the standard interpretations of quantum physics to the mathematical 
objects. Suppose X : % — > % and Y : V — > V are linear operators. Then for n € No and m € N we 
have the following linear operators on % 

n 

x n := y {n > x® (g)i P y(">* , 
1 

m — 1 



F m : = y(") l^g (gjlpOF V (m) * 



(empty tensor products to be omitted for n = and m = 1). We refer to these operators as observables. 

Proposition 2.6. Suppose that the process (H, V, f)) is unital. Then the observables Y m commute 
with each other and Y m commutes with X n whenever m < n. 

Suppose further that Yej — j €j for the orthonormal basis (ej )^ =1 of V which we use to define the 
isometries Vj. If the process is prepared in a vector state given by a unit vector 77 in the range of 
V a so that the m-th letter in a is equal to k then the measurement of Y m yields the outcome k with 
certainty. 

Proof. For 1 < m < n we define 

m—l n 

Y m , n := yw i H ® (g) \ v ® y ® (g) \ v y (n) * 

1 m+1 

(empty tensor products omitted in the cases m—l and m = n). Because the process is unital, V is 
a row unitary and in particular V 1% <8 lp V* = 1%. It follows that Y m = Y m n whenever m < n. 
Writing Y m in this way it becomes obvious that it commutes with X n and Y n . 

Let P ek be the orthogonal projection onto the one-dimensional subspace Ce^ of V. Then 

m—l m — l 

(/?, y< m ) i w ® (g) %> ® p £fc y( m )*r;) = (y( m )*ry, t H ® (g) i?> ® p et y (m N) 

1 1 
is the probability that a measurement of F m yields the result k, according to the standard rules of 
quantum mechanics. Now suppose that the process is prepared in a vector state with a unit vector rj 
in the range of V a , i.e., 77 = V a £, for some £ € H, so that the m-th letter in a is equal to fc. Then from 

y (m N= E We®*/* 

|/3|=m 

(with e/3 for f3 — Pi . . . f3 m is short for ® . . . € (S)" 1 ^ 3 ) we see that only the terms in the sum 
with /3 m = fc can be non-zero and hence the application of P Cfc at the m-th copy of V always acts 
identical on all the non-zero terms. So the probability above is the squared length of a unit vector, 
i.e., it is equal to 1. □ 
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This result gives an operational meaning to the words a £ Fj~ by identifying them with measurement 
protocols for certain observables. We can think of measuring Y\, . . . ,Y n as performing a certain type 
Y of measurement at the consecutive times l,...,n and the commutation properties proved above 
ensure that these measurements can be performed without perturbing the system (non-demolition 
measurements). For example with x £ B(t)) it makes sense to consider J^ n \x) — X n for X = xp^ 
conditioned on Y\ , . . . , Y n (conditioning in the sense of classical probability theory) . Such schemes and 
their application to quantum filtering and quantum control have been pioneered by Belavkin 0, 0] 
and this is a rapidly developing interdisciplinary field of study. We mention the recent introductory 
surveys 12, 13| which give many references, the latter focusing on discretized models and containing 
constructions similar to the one above ([13j, Section 2.5). It is of course interesting to prepare the 
system in other states where the outcomes are not deterministic but only statistical information can 
be obtained, see for example pjj which uses discretization of continuous processes and thus can be 
directly connected with the approach here. For a unital process the Hilbert space H is always the 
orthogonal sum of the ranges of the V a for all a with a given length, so in principle the problem can 
be dealt with for an arbitrary vector state by decomposing the state vector with respect to such an 
orthogonal sum and then using Proposition 12.61 This may not always be the most practical path to 
follow for data given in a different way but it should be of theoretical significance. In this place we 
only want to indicate the relevance of the V a for applications and so we don't go into further details 
here. 

3. Representations of Structure Maps 

If for the multi- variable systems introduced in Section [1] the spaces are Hilbert spaces and the linear 
maps are contractions between these Hilberts apaces then we want to think of them as appearing inside 
the weak Markov processes introduced in Section [2l The following definitions give a precise meaning 
to that. Then we justify the definitions by discussing how the represented structure maps can help us 
to understand the properties of the process. 

Definition 3.1. A representation of an input pair (A,B) (with column contractions A : X — > Q)f X 
and B : U — » (J)^ X) is given by a process (V., V, f)) such that X = t) and A = V*\\^ together with an 
isometry i :U — > £ such that, with U := io(U) the represented input space, we have B — V*\u io- 

A representation of an output pair (A, C) (with column contractions A : X — > X and C : 
X — > y) is given by a process (H,V,t)) such that X = f) and A = V*\^ together with an isometry 
jo : y — > f)i = rj © £ such that the represented output space J^o ■— io{y) is wandering for V and we 
have C = j^Py \t,. 

A representation of the structure maps (A, B, C, D), where D : hi — > y, is given by representations 
of (A, B) as an input pair and (A, C) as an output pair as above with the same process {%, V, f)) such 
that D = joPy a \u io- 

In the following we shall often suppress the isometries i$ and jo and treat them as identifications 
whenever this simplifies the notation. Note that a represented input space Uq is always wandering 
because by definition it is a subspace of the wandering subspace £ , see Proposition [231 An interesting 
example for a represented output space is 

£* := i)% 9 V(t) ®V)=HQ V(H ®V) = ker V* 

which is the wandering subspace arising from the Wold decomposition of the row isometry V, see |33j |. 

Let us write Up for VpUo and y a for V a yo, further we denote by U+ the closed linear span of all 
Ma, by y+ the closed linear span of all y a (with a £ F^ ). With suitable identifications we can think 
of U+ as an orthogonal direct sum of copies of the input space U, similarly y + is an orthogonal direct 
sum of copies of the output space y. We sometimes refer to these copies as a-translated input resp. 
output spaces. 

Representations of structure maps as above are always causal in the sense that y$ _L VpUo — Up for 
all (i £ F^ with |/?| > 1. More general it follows further that y a _L Up whenever \a\ < \/3\ which we 
also refer to as causality. 
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Note that from a representation with A = (A%, . . . , A^Y we get a Kraus decomposition for the 
transition operator of the process, namely 

d 

Z(x) = A*xA = A* k xA k , 

k=l 

(limit in the strong operator topology if d = oo). 

Given a process (H, V, f}) any subspace of £ interpreted as a represented input space gives rise to 
a representation of an input pair (A,B) and any wandering subspace of f)i interpreted as a repre- 
sented output space gives rise to a representation of an output pair (A, C) and together we have a 
representation of structure maps (^4, B, C, D). 

Proposition 3.2. Given a representation of structure maps (A,B,C,D) by a process (H,V,t)) let 
£ — £ © (B V a r) a be an element of H, with £ € f) and r\ a &Uq. If (for all words a) 

x(0) := £, x(a) := P h V:i 

u(a) := r) a ( => V a u(a) = P u ^ ) 

then we have (for all words a and generators k = 1, . . . , d) 

x(ak) — Akx(a) + Bku{a). 

If further y(a) := Py V*£, ( V a y(a) = Py a £ ) then we have 

y{a) = Cx{a) + Du(a). 

Hence we have a noncommutative Fornasini-Marchesini system (compare Section [T]). 

Proof. The first assertion follows from 

x(ak) = == P„T4*V*f = P^P^V^i 

= V^P^ + V k * Va = A k x{a) + B k u(a). 

Using causality we find that 

y{a) = Py V:£ = Py P t ,®u V:£ 

= Py [x(a) + u(a)] — Cx(a) + Du{a). 

□ 

Note that the proof uses causality but does not use the fact that J^o is wandering. Hence the result 
is valid for generalized (non-wandering but causal) output spaces. 
With the maximal choice 1Aq — £ we have for any vector £ € T-L that 

Pv a u I = Pv a t,i® Pvm, I = V a x(a) 8 V aP u{ap) , 

which is relevant information with respect to Proposition ^. 61 above. 

Let us now work out the observability map Oc,a and the transfer function T (already introduced 
in Section [T]) for represented structure maps. If we identify y a ~ Q y then 

Pyc.\*=V a Py VZ\ h 2i a CA a 

in other words, we obtain the a-entry CA a of the observability map Oc,a by projecting the represention 
t) of the internal space X to the a-translated output space y a . 
Similarly with y a ~ a y and Uq ~ U we obtain 

C D if a = 

Py a k = V a Py V: \ Uq — — ck \ CB a if \a\ = 1 

[ C A ar . . . A a2 B ai if a = ot\ . . . a r , r = \a\ > 2 

In other words, we obtain the a-coefficient T a in the formal power series expansion of the transfer 
function T (already introduced in Section [T|) by projecting the represention Uq of the input space U 
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to the a-translated output space y a . Causality implies that further, with identifications y a ~ Q y and 
Up ~£ U, 

PI „ / T a if a = (3a 
*y a \u p -a,p I otherwise 

This pattern in the operator- valued kernel {Py a \up) a + describes what is called a multi-analytic 
kernel. It is an operator-valued matrix describing the contraction Py, \u + for the orthogonal decompo- 
sitions with respect to the translated output resp. input spaces. It follows from this description that 
this contraction intertwines the row shifts on Q)plAj3 and @ Q y a which are obtained by restricting 
the row isometry V. This is the defining property of a mult i- analytic operator and we have verified 
it for the contraction Py, \u + . Multi-analytic operators have been introduced and studied by Popescu 



34 . |36| , the situation above involving transfer functions for pairs of wandering subspaces is worked 
out in more detail in [25| . 

We can think of it as a kind of multiplication by the transfer function T . In fact for d = 1 it is 
literally multiplication by an ordinary analytic function. But for all <i > 1, for a noncommutative 
Fornasini-Marchesini system, we can still verify the following analogy: If we introduce formal power 
series for the entries y{a) and u{0) (see Proposition 



u{z) = u(zi, ...,Zd) = y~ K ^ u((3)z 
P 

y( z ) = y( z %, ■ ■ ■ , z d) = ^2 y( a ) 2: " 



then, with the initial condition x(0) = 0, we have the input-output relation 

y(z) = T{z)u{z). 

Recall the convention z a = z ar . . . z ai for a = a% . . . a r which gives the multiplication rule z a z^ = z^° '. 

4. Categories of Processes and 7-Cascades 

We introduce additional concepts with the intention to describe substructures of processes. We say 
that (Q, V s , q) is a subprocess of the process (%, V, f)) if g is a closed subspace of t) which is co-invariant 
for V and V s = V\g where Q = span{V a g : a £ F^}- Note that g is also co-invariant for V G and 
(G, V s , g) is a process in its own right. 

Given a subprocess (G, V s , g) of a process (H, V, f)) we can form the quotient process 

(H,V,t>)/(g,V s ,g) := (IC,V K ,t) 

where 6 := f) 9 g, K, := span{V a t: a e Fj~}, V k := V\/c- Let us check that (/C, V K ,Vj is a process. 
Indeed, because g is co-invariant for V we see that K. is contained in HQg. Hence K9t is contained in 
HQl) which is a l/-invariant subspace orthogonal to t. Hence £ is co-invariant for V K which proves our 
claim. Note that in general t is not co-invariant for V. Considering adjoints we see that V Q * — V*\g 
but only V K * — (Ptc®l-p)V*\ic- So we need to distinguish carefully between subprocesses and quotient 
processes. 

It is convenient to reformulate these concepts within a category of processes which we define now. 
The objects of the category are the processes with a common multiplicity space V . A morphism from 
(1Z, V n , r) to (S, V s , s) is a contraction t : r — > s which intertwines the adjoints of the row isometries, 
i.e. 

V s *t = (t(g) l v )V n *\ x . 

(or written differently: Af t = tAf for k = 1, . . . , d and Af = Vg*\ s , Af = Vf*\ t ). Composition 
of morphisms is given by composition of operators, the identity morphism is given by the identity 
operator. 

We add some immediate observations justifying this definition. First note that a morphism t is 
an isomorphism if and only if t is unitary and we get a reasonable meaning for two processes to be 
isomorphic. In the following we often identify isomorphic processes. 
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Also note that if we have a morphism given by a contraction t : t — > s then t can always be extended 
to a contraction T : 1Z — > S such that T\ t — t and ||T|| = and 

V S *T = (T® t v )V n * , 

this is nothing but the commutant lifting theorem in the version of Popescu [35| . We call T an extended 
morphism associated to t. We can also think of a morphism as the class of all extended morphisms 
associated to t, this attaches a global interpretation to it. 

Finally it follows from V s * t = (t ® l v )V n % that t(t) is co- invariant for V s , and hence from a 
morphism t from (TZ, V n ,x) to (S, V by defining g to be the closure of t(x) and V s := V \g, we 
always obtain a subprocess (Q, V G , g) of (5, V s , s). 

In fact, we can reformulate subprocesses and quotient processes in terms of morphisms as follows. 
If (Q, V s , Q) is a subprocess of a process (H, V, f)) then 1 , interpreted as an embedding of g into f), is 
an isometric morphism: 

V*l B = (l g ®l v )V s *\ g 

Conversely, given an isometric morphism t from a process (Q,V s ,g) to a process V, f)) we can 
interpret t as an embedding 1 and the properties of a morphism ensure that g is co-invariant for V. 
With an extended morphism tg, interpreted as an embedding of Q into "H, we can also arrange that 
V s = V\g (by uniqueness of a minimal isometric dilation up to unitary equivalence). So we get a 
subprocess. 

For the corresponding quotient process (/C, V , t) we can check that the orthogonal projection 
Pj : rj — >• J is a coisometric morphism: 

F^Pt = (ft <8 1t>)V*| 6 

Conversely, if we have a coisometric morphism t from V, f)) to a process (K^V^jt) then we can 
interpret t as an orthogonal projection Pt : f) — > {, then use the properties of a morphism to check that 
g := f) f is co-invariant for V , giving rise to a subprocess so that the corresponding quotient process 
is the process (K, 1 V K: 1 i) we started from. Note that P s and 1{ are not morphisms, in general. 

The image of 1 B is equal to the kernel of P{ and we can proceed to give a concise description of the 
situation as a short exact sequence of processes: 

- (Q, Va,g) (H, V, f)) (K, v K , e) > 

Here stands for the process on the O-dimensional space which takes the role of a zero object in our 
category. In the following we suppress the embeddings in the notation whenever this is convenient. 

Lemma 4.1. Given a short exact sequence of processes 

(g, V s , g) (H, V, f)) — (/C, V K , t) 

then for the relative position of the wandering subspaces 

£ 3 := spaniQ, V g (g ® P)) G g 
Si := span{t, V K {t <g> P)) G F*^ <8 P) 

we obtain 

More general, for all a G F^ 

PqWcEI = ft^f |y Q £, e = K*Pe<> |ft V£|v a £« 
ftc|v a £* = -fVcg* |v„£» = V a Pei\s»V*\v a sn 

Proof. We have e * C f) ffi e where £ := span(f), V(f) ® V)) © f). Because F*f)©£ = f)®Pwe conclude 
that V*£* C f) ® P. But by the definition of £* we also have 

(flc ® lp)F*£* = V*^* = {0} 
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and it follows that V*£\ Cg®P. Hence if £ e £ fl , 77 e 5* and a 7^ then V^r? C g and 

and it follows that £\ _L V a £ B for all a 7^ 0. Obviously also £ ' _L g and so Pg$ \ £ t — Pg\st which is the 
first formula we intended to prove. 

It is clear that (Pgs |ge) = Pgt |gj . To get the equality P/c If = Pe 1 If 8 we argue as follows. Consider 
£ { := span(t, V(t © T 3 )) G t. Because £ f _L f) it follows that £ t C £ and because 5 is wandering for V 
wc conclude that V a £ 1 _L f) © £ if a 7^ and hence 

flc((je5) = «©£'. 

Now, because £ B C f) © £, we can compute 

Pclfo = PicPt,®£ If = -Ptef^f" = -Pfjlf" • 
For the last equality above note that 5| = (6 ffi£ { ) © V(t but £*> C span{g, V(q<E)V)} _L V(«®7>). 

Because £ 9 and £* are wandering it is now immediate that 

V a £ B ±Vp£t if a 

To get the first line of the general formulas we only have to note additionally that always Vp£\ _L g. 
To get the second line we have to prove that additionally 

V a £ s _L JCQ V p £l=:JC res (residual part). 

Indeed, if a = this follows from Pk.\e<> = P£t\ss> shown above. Then for other a reduce it to 
the previous case by using the fact that V k \k:„ s is a row unitary and hence V*JC res = V K *JC res C 
JC res <g> V. □ 

These observations suggest the following construction which allows us to classify the processes which 
can be obtained by short exact sequences, given the subprocess and the quotient process. 

Definition 4.2. Given processes (Q, V s , g) and (JC, V K , t) (with a common multiplicity space V) and 
any contraction 7 : £\ — > £ B we define the 7-cascade 

(G,V g , Q ) < 7 (JC,V K ,t) := («,V,f)) 

where 

H := 0©/C© (D r )a- 



V := 



(1„ © ( J*. ) ) ^ on fl 

T/ K on/C 
canonical row shift on ® ag ^+ (X> 7 * 



We add the following explanations for this definition. D 7 . := VI — 77* is the defect operator for 
7* and T> 7 *, the closure of its range, is the defect space. Then Q(EF + (T> 7 *) a is the orthogonal sum 

of a family of copies of T> 7 * indexed by the free semigroup , so what we mean by the canonical row 
shift on this space is just moving elements between these copies. Further, to explain the action of V 
on q, note that V s maps g into g © £ B and now 1 B acts as identity on g, the contraction 7* maps £ 3 
into £ I C JC and D 7 . maps £ B into 2? 7 » which we interpret as (2? 7 *)o C ® ae p+(J) 

Note that for 7 = the 7-cascade is nothing but the direct sum of the two processes (Q, V G , g) and 
(JC,V*t). 
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Theorem 4.3. Given processes (Q,V s ,q) and (K,,V K ,i) (with a common multiplicity space V) and 
a contraction 7 : £* — > £ s . Then the 7-cascade 

(0,V o ,fl) < 7 (AC,V^,«) 

is a process, 

7 = ffe»U,j = Pg\e* 
7* = i'fiilf. =Pk\s* 

and we have a short exact sequence 

-> (a, a) ^ (e, g) < 7 (AC, «) A (ac, v* t) -> . 

In other words, (Q, V s , g) is a subprocess and (AC, V K , 6) is a quotient process of the 7-cascade. 
Conversely, in a short exact sequence 

(G, V s , Q ) {%', V, f)') — (AC, V* «) — - , 

where 1^ resp. P' t are an isometric resp. a coisometric morphism, the process (W, V, f)') is isomorphic 
to a 7-cascade for a contraction 7 : — > £ 3 such that moreover the corresponding extensions are 
equivalent, i.e. the following diagram commutes: 

(G, V s , g) - (H', V, b') (AC, V K , t) 

j isom. 

(G, V s , q) — (G, V 3 , 0) < 7 (AC, V* I) (AC, V* {) 

Two cascades with 71, 72 : — » 5 s arc equivalent as extensions if and only if 71 and 72 are equal. 

We can summarize the theorem by saying that there is a one-to-one correspondence between equiv- 
alence classes of extensions of the process (AC, V^t) by the process (G,V^,g) and contractions from 
£ * to £ B and this correspondence is given by the 7-cascade construction. 

Proof. Working within the 7-cascade as defined above we have £ identified via the isometry f ^ 

with a subspace of £\ (£V)p. Note that £\ is the wandering subspace arising from the Wold 
decomposition of V^, see [33|, and (2? 7 »)o is another wandering subspace with all translates orthogonal 
to the translates of £\. Hence the embedded £ 3 is wandering for V. With this it is now easy to check 
that V is a row isometry, that (H, V, f)) is a process with subprocess (Q, V s , 0) and quotient process 
(AC, V , t) and that 7 and 7* satisfy the formulas stated. 

Now suppose that the process (%', V, f)') is given by a short exact sequence, i.e. as an extension of 
(AC, V , t) by (0, V" e , 0). We define 7 := Pgo \ £ t : £\ — > £ B and then form the corresponding 7-cascade 
(0, 0) < 7 (AC, V , {) = (%, V, i)). We verify that this yields an equivalent extension by constructing 
a unitary J* : H' — > H which intertwines the row isometries V and V and which maps l fl £ to l g £ if 
£ € and (Pjc)*r] to (P/c)*r/ if 77 e AC. In fact, then the adjoint J : "H — > W is an extended morphism 
and its restriction j : () — > f}' is the isomorphism we look for. 

To see that J* exists it is enough to check that the remaining parts can be matched correctly. We 
invoke the following lemma which is a standard tool in operator theory. 

Lemma 4.4. Let £ and C be Hubert spaces and £0 a closed subspace of C. If w : C — > C is an 
isometry such that C = span{Co, w£'} then w is unitarily equivalent to 



P 

D n 



CJ — > £q T> p where p — Pc w . 
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The unitary from C to Cq © T> p needed in the lemma is the identity on Cq and it is w£ i— > 



Pi 

D P i 



for £ G £' . We apply Lemma T4.4I with £' — £ e , with w being the isometric embedding of £ 3 into H 1 
and with C = span{K, w £ 9 }, Co = JC. Lemma \A . 1 1 shows that p = 7* in this case and moreover that 
the embeddings of translates V a £ 3 follow exactly the pattern exposed by the 7-cascade. Hence we can 
put all the pieces together and get J*. 

Finally, if (G, V G ,g) <J 7l (IC, V K , 6) and (G, V s , q) < 72 (/C, V K , t) are equivalent extensions then the 
unitary intertwiner J* constructed above maps for each element of £ B its first embedding to the second. 
The same happens to elements of £ Hence if we suppress the embeddings we find that Pga |ge is the 
same operator in both cases, i.e., 71 = 72. □ 

Remark: It is instructive to look at the situation described in the previous theorem from the point 
of view of dilation theory. Then we start with a row contraction on f) = g © t of the form 

X 
Y Z 

which is called a lifting (in our application this is pt)V\^^-p). It is well known that in such a situation 
Y must have the form {Dz* )*J*Dx with a contraction 7 : T>z* — > Dx, see 18]. Chapter IV, Lemma 
2.1 for d = 1 and [TH, Prop. 3.1 for the general case. Hence a 7-cascade can also be thought of as a 
description of the structure of the minimal isometric dilation of such a row contraction. Liftings and 
their dilations are studied in lf| 16 1 and the results can be interpreted in the language of processes 
which we use in this paper. 



5. Cascades of Systems 

One of the things one can do with linear systems is to stick them together in various ways. The 
most basic way to do that is to take the output of one system / and to use it as the input of another 
system //. The combined system is then called a simple cascade, see for example [1 91 ] for the classical 
theory (d = 1). It also works for the noncommutative Fornasini-Marchesini systems we have been 
considering here. We need a slight generalization where the input of system 77 is obtained from the 
output of system / by applying a transformation T to it. We call this a T-cascade of systems. 




We assume here that the two noncommutative Fornasini-Marchesini systems both have the same 
multiplicity d. Then the internal space of the combined system is defined to be the direct sum of 
the internal spaces of systems / and II and, with Ujj(a) = Ty/(a) (for all a € F^), it follows, by 
eliminating variables, that the structure maps (A, B, C, D) of the combined system are obtained from 
the structure maps (A 1 , B 1 ,C I , D 1 ) of system I and {A H ,B H , C n , D 11 ) of system 77 by 



A 3 = 



Bf r c 1 





ay 

-)// 



B, 



b] 1 r d 1 



(j = 1, 



,d), 



c = ( d 11 r c 1 c 



D 



d 11 r d 1 



In this case we also speak of a T-cascade of structure maps. 

The transfer function T of such a T-cascade of systems (or of structure maps) factorizes. If 7 is a 
power series with coefficients T a € B(V, W) and V maps W to W then we denote by FT the power 
series with coefficients T7 Q E B(V,W). With this convention it is not difficult to check that the 
transfer function T of the combined system is obtained from the transfer functions T 11 and T 1 of 
systems II and I by 

T(z) = T II (z)TT I (z) 
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(with z a zP = z? a ). 

Now we prove that if (H, V, h) is a 7- cascade (G, V s , g) < 7 (JC, V*, t) of processes then we can think 
of its Fornasini-Marchesini system, from Proposition 13.21 as a cascade of the systems associated to 
(G, V s , g) and (K, V K , t). This is not so obvious if you arrive at the notion of a 7-cascade of processes 
from a dilation point of view but it nicely clarifies the system theory involved and of course it motivates 
our choice of terminology. 

We assume that we have structure maps (AP , , , represented by the process (G,V G ,g) 
and (A*', B K , C K , D K ) represented by the process (fC, V K , t), according to Definition 13.11 
Now we form a 7-cascade of processes 

(g,v s ,q) < 7 Qc,v K -,€) = {n,v^) 

and we introduce output and input spaces represented by (H, V, f)): 

y -=y§, u -.= u£. 

We have the inclusions C £ l C £ and y? C g(B £ B C f) © £ and y G i s wandering for V G and hence 
for V, so indeed this yields a representation of structure maps (A,B,C,D) by the process (H, V, h). 
Further we define 

where the latter equality follows from U§ c £ 8 together with the geometry of the 7-cascade (see 
Definition 14.21 and Lemma f4 . 1 1) . 

A subspace C of % is called a left support of 7 if PcjPgt = "/Pe' and it is called a right support of 7 
if 7?ji = jP £ ePc- With these preparations we can now find a T-cascade of systems in the 7-cascade 
of processes. 

Theorem 5.1. Suppose that U G is a left support and y^ is a right support of 7. Then the structure 
maps (A,B,C,D) are a T-cascade of (A K , B K , C K , D K ) and (A G , B G , C G , D G ). Explicitly (for j = 
l,...,d): 

( A K \ ( B K \ 

J ' = { BfTC* A G J Bj 1, BfTD* ) 

C = ( D G TC K C G ) , D = D G TD K . 

Roughly speaking, U G left and y^ right supporting 7 means that they are chosen big enough to 
transport the information contained in 7. An example is given by the choice U% := £ B and yfr := £| 
and in this case T = 7. 

Proof. We verify the explicit formulas step by step. The arguments are based on the geometry of a 7- 
cascade as given in Definition ^. 21 in particular: V\ g — V G \ g maps g into g®£ and (g®£ 3 )QV G (g^V) 
is orthogonal to the range of V G , further P fl ©£s |jc = Pso |jc = 7^t |/c- 

p 4 ^ - p,v;\ h = ( p ( y;| t p t y;| ) = (Af o ) 
^ = Wln = ( Wit Wl« ) = ( if *| 8 ) 

= (V?*P £s7 P £i \ t V G *\ 5 ) = (V G *P u? P £ z 1 P £l PyK\ t Vf*\ a ) 

= ( B G TC K A G ) 

P e ^ = P t V*\ U K = PiVf*^ = Bf 

P B B 3 = P g V*\ U K = P B V G *P s(B£e \uk = P B V G *P ug P £el P £l P y , \ u k = B G TD K 

C = ( Pyg\ t P yg \ B ) = ( PygPugPs.'YPeiPyfU Py§\ S ) = ( D G TC^ C G ) 

D = p y? k* = Pyg lw* = ^ ^« iPei Py* \u- = D G TD K . 



16 



R. GOHM 



□ 

It is possible to iterate the construction shown in Theorem 15.11 Because V*£S = V s * £® = {0} we 
have £t C^,. Hence if (J-, ' , f) is another process (with the same multiplicity) and 72 : £ * — > £^ is a 
contraction then not only can we form (J 7 , V^, f) < 72 (Q, V g , q) but also (J 7 , , f) <1 72 [((?, y e , g) < 7 
(/C, V*,*)] with 72 : £* -> £ f given by 

* _ f 72 on 

72 -_ \ on £„ 9 £ * B 

Theorem 15. II applies iteratively. This may be seen as another justihcation for calling such a structure 
a cascade. 

6. Observability and Asymptotic Completeness 

To make use of the system theory now available to us for the study of processes we discuss the 
control theory concept of observability in the multi-variable setting. See for example Q for a recent 
treatment in a purely operator theoretic spirit. 

Definition 6.1. Given an output pair (A, C) for an internal space X and an output space y, a subset 
X 1 C X is called observable if (CA a \x>) aeF + , the observability map restricted to X', is injective (as 

a map from X' to the y~ valued functions on F^). 

If (A, C) is represented by the process (H, V, I)) then we also say in this case that X' (c fj) is 
observable in (%, V, fj) by the output space J^o- 

The interpretation of observability is that every £ € X' can be reconstructed from the outputs 
CA a £. 

Proposition 6.2. The subset X' C f) is observable in (H, V, f)) by the output space J^o if and only if 
Py+\x' is injective. 

Proof. For £ £ X 1 we have 

and we conclude that X' is observable if and only if (Py a \x') a£F + is injective. Because the projection 
Py + is the supremum of the projections {Py a ) ae p+ we can replace the family (Py a \x') a& p+ by the 
single contraction Py + \x>. □ 

If we have a 7-cascade and we use the maximal input space £ a of the subprocess as output space 
for the cascade, roughly speaking if we confine our observations to the subprocess, then questions of 
observability become an interesting issue of theoretical and practical importance. 

Theorem 6.3. Consider the 7-cascade 

(g,V s ,g) < 7 (K,V' c ,t) = (H,V,t>) 

and the output space J^o : = £ 3 - The following assertions are equivalent: 

(la) t is observable in (H, V, f)) by 3^o- 

(lb) Py + \t is injective. 

(2a) span{(A a )* q: a € F+} = t) 

(2b) Pfy\g : Q —} i) has dense range. 

(3a) Q = H 

(3b) There is an extended morphism associated to the morphism l g : q — > f) 

which is a unitary from % to %. 
(4a) V K is a row shift and 7 : £\ — > f 8 is injective. 
(4b) F K is a row shift and 7 : £ | — > £ B is isometric. 
If the transition operator Z of ('H, V, f)) is unital then we also have the following equivalent condition: 
(5) lim^oo Z n (P s ) = If, (in the strong operator topology) 
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Proof. The equivalence of (la) and (lb) is Proposition ^. 21 Further 

Py+ t injective 

Pg\f, injective (by adding the space g) 
Pj, | g has dense range (adjoint map) 

which shows (lb) O- (2b). Because G — span{V a Q : a € F~f} assertion (2b) is equivalent to 

f) = span{Pt,V a Q: a e F+} = span{(A a )*g: a e F+} 

which is (2a). On the other hand 

Pfj\g has dense range P-h\q nas dense range 

because H is the closed linear span of the V a t) with a € Ft and we have Q D V a G, hence 

Pv^G d p v ^v a g = v a p„g 

which is dense in V a f). But Q is a closed subspace of T-L and we conclude that (2b) implies (3a). The 
converse, (3a) implies (2b), is obvious. 

It is easy to check that in a 7-cascade the morphism l g , the embedding of g into f), always has lg, 
the embedding of Q into "H, as an associated extended morphism. If we have (3a), i.e. Q — H, then 
lg is nothing but the identity operator 1% on "H, hence (3a) implies (3b). Conversely (3a) is implicit 
in the statement of (3b). 

From Definition 14.21 of the 7-cascade we have 

with £l C K. and (-D 7 .) c JC^, from which we get P/c3^+ C ® Q V a £\- If we have (3a), i.e. Q = H, 
then, because Q — g ® y + and g _L /C, we find P/cy+ = K. and hence K, = @ Q V a £ I which means that 
V K is a row shift. From (3a) we have JC C Q and together with £\ _L g and £ \ _L V^f 3 for all a ^ 
(by Lemma \A. II) we conclude that £\ is actually a subspace of £ s which means that 7 = P^g \ £ e is an 
isometric embedding. Hence (3a) implies (4b). Obviously (4b) implies (4a) and from (4a), together 
with Lemma |4.1[ we get an injective map 

a a a 

and (lb) follows. 

Finally if Z is unital then the projections n (P g ) increase with n and converge to their supremum 
Pg (see Section [2]), hence Z n {P Q ) = Pf ) 9 n (P s )\t ) converges to P^Pg\^ (in the strong operator topology). 
Now (3a) implies that P^Pg]^ = \ and this implies (2b), hence all these assertions, including (5), are 
equivalent. □ 

Remark: Of course (3b) does not mean that (Q,V s ,q) and (H,V,t)) are isomorphic as processes. 
In fact, if g ^ f) then the morphism l g , the embedding of g into f), is not unitary. 

Definition 6.4. The short exact sequence 

@, V g ,g) (H, V, f)) — ^ (JC, V K , t) 

is called asymptotically complete if one (and hence all) of the assertions in Theorem l6.3l are satisfied. 

This terminology is borrowed from scattering theory. It will be explained in the final section of this 
paper how the setting above is related to already existing scattering theories for Markov chains. 

Corollary 6.5. Let an asymptotically complete sequence as in Definition 16.41 be given together with 
an input space Uq and the output space D^o = £ B , both represented by (U, V, fj). Then Oc,A\t = 
(CA a \t) aeF + = (C {A K ) a ) aeF +, the restriction of the observability map to 4, is isometric and the 
transfer function T is inner, i.e., the corresponding multi-analytic operator is isometric. 
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Proof. Note that y = _L g indeed gives us CM Q | { = C(A K ) a for all a. We have V a CA a \ t = Py a \ t . 
The multi-analytic operator corresponding to the transfer function T is Py + \u + ■ 

From asymptotic completeness it follows that g © y+ = Q = H, see Theorem 16.31 (3a). Hence, 
because £ and U+ are orthogonal to g, they are both subspaces of y+ and it follows that Py + |t and 
Py+ \u+ are isometric. □ 

We describe a few alternative ways to interpret asymptotic completeness. First, if we are in an 
asymptotically complete situation and we choose Uq maximal, i.e. Uq = £, then the sum of the 
identity l fl (on g), of the observability map Oc.aW ( on ^) an d of the multi-analytic operator associated 
to T (on U+) is equal to the identity \u- This is a direct consequence of Corollary 16.51 But it is the 
identity \% presented with a change of basis that describes the relative position of the weak nitrations 
of the process {H, V, f)) and of the process (Q, V g , g). 

Second, from the point of view of dilation theory, in the case of asymptotic completeness we deal 
with socalled subisometric dilations. We don't go into this here, see [16j for details. 

Third, asymptotic completeness and its observability aspect can also be given an explicit quantum 
physical interpretation by reconsidering the observables discussed in Proposition 12.61 Indeed, in the 
case of asymptotic completeness the Hilbert space H is the closed span of the spaces g ® 0" V 
(with n e N , the case n = to be interpreted as g). It follows that the algebras B(g(g>®"V) V^ n > 
generate B(H) (as a von Neumann algebra). On the other hand, the observables 

n 

x n = x® §§t v yw* , 
i 

m— 1 n 

Y m , n = i w ® (gjlp^y® (g) t v W B >, m = l,...,n, 

1 m+l 

if we consider all X € B{q) = P g B(t))P s and all Y € B{V), generate B{q ® 0"P) V^* (as a von 
Neumann algebra). Conclusion: We can approximate arbitrary observables in B{%) in the weak (or 
strong) operator topology by observables generated by these X n and Y m ^ n . In particular, in physics 
language, we can answer all questions about observables on the part fi (and hence also find the state) 
from measuring observables on the orthogonal part g of the internal space (the X n ) together with field 
observables (the Y m ^ n ). We have the following quantitative statement about these approximations 
which stresses the role of the observability operator Oc,a- If £ G t then the norm distance squared of 
£ from the space g„ = g ® 0| a | <n V a £ B is given by ||£|| 2 - J2\ a \<n \\ CAa ^\\ 2 - If we nave asymptotic 
completeness then 0c,a|{ is isometric and this distance tends to for n — > oo. 

7. SUBPROCESSES FROM NORMAL INVARIANT STATES 

The following way of finding subprocesses gives a connection to a topic which is of natural interest 
for quantum Markov processes and more general for quantum probability: invariant states. Recall 
that if is a normal state of a von Neumann algebra then in this von Neumann algebra there exists 
a smallest orthogonal projection p such that <p(p) — 1, called the support projection s(<fi). For all 
elements x we have 

</>(x) = <p{s(<p) x) = (j>(x s ((/))) = (j){s(4>) x s{4>)) . 



Essentially the following is Lemma 6.1 from 1J], for convenience we include a proof which uses our 
now familiar terminology and notation. 

Proposition 7.1. Given a process (%, V, f)), suppose that is a normal state of B(t)) which is invariant 
for the transition operator Z, i.e., 

<j>(Z(x)) = (f>(x) for all x e . 
Then with P B := s(<p) the subspace g is co-invariant for V. 
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Proof. Let £ := f) 9 fj. Then 

= 0(P t ) = 0(Z(P t )) = 4>(P a Z(P t )P g ), 

hence P s Z{P t )P s — (because restricted to the subalgebra s{(f>) <B(f)) s(<f>), obtained by compression 
with its support, is a faithful state). But 

P 3 Z(Pt)P = P S V(P { ® 1 P )V*P B = X*X 

with X = (P{ <g) Hp)V*P B . Hence X = which shows that g is co-invariant for V. □ 

Using Proposition 17.11 we can always find a subprocess from a normal invariant state and this 
subprocess is nontrivial, in the sense that g ^ f), if and only if the state is not faithful. Definition 
16.41 for asymptotic completeness applies and we can use the criteria in Theorem 16.31 Instead of 
further analyzing the general case we concentrate for the rest of this section on an application to 
a class of noncommutative Markov processes which are not originally weak Markov processes but 
which nevertheless can be studied successfully by our methods. 

The following construction represents the most basic way of introducing stationary Markov chains 
in an operator algebraic context. In this form it is taken from see more details there. A more 
leisurely introduction to the topic is [HI]. Let A and C be C*-algebras and let 

j : A A ® C 

be a non-zero *-homomorphism. This can be iterated to yield the *-homomorphisms 

n 

]n ■ A -> A <g> C , 
1 

where ji(a) := j(a) —: J2k a k® € A ® C and then inductively 

n— 1 n 

j n (a) :=^2jn-i( a k)®c k € {A®(g)C)<S>C = A®(g)C. 

k 1 1 

If (j) resp. tp are states on A respectively C and we impose the stationarity condition 

[<j> ® ip) o j = (j> 

then we can think of the sequence (j n ) as noncommutative random variables which form a noncom- 
mutative stationary Markov chain. 

We can obtain an associated weak process in the sense of this paper by applying the GNS-construction 
which from (A,4>) produces (f),^) and from (C,ip) produces (V,^), the GNS-Hilbert space and a 
cyclic vector representing the state (in each case). Then the stationarity condition translates into the 
fact that 

(with a iE A) is an isometry. In Chapter 1 of [2l| v\ is called the associated isometry and it plays a 
central role there in the analysis of the Markov chain. We are now in a position to deepen this analysis 
and to make the conceptual framework more elegant by putting it into the context provided in this 
paper. We start with the adjoint 

v\ : t) ® V -> 1} 

which is a (row) coisometry. Hence its minimal isometric dilation 

V :H®V^H 

is a row unitary and {1-L, V, f)) is a unital process. (It is a well known fact in dilation theory which 
can be checked directly that the minimal isometric dilation V on % of a row contraction on () is a 
row unitary if and only if the row contraction is coisometric.) We call (H, V, (]) the (weak) process 
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associated to the original stationary Markov chain. 

Remark: If we think of V* as a kind of (Schrodinger) dynamics of vector states for a weak pro- 
cess then we see that for an associated weak process this is provided on f) by the associated isometry 
v\ . Note however that if we consider the original noncommutative random variables j n as steps of a 
(Heisenberg) dynamics of observables then there is a time reversal involved if we go to the noncom- 
mutative random variables j( n ' (as in Section [2]) as steps of a (Heisenberg) dynamics of observables 
for the associated weak process. 

But there is more structure available from a stationary Markov chain. One can check that the 
vector state represented by f2^ is invariant for the transition operator Z of the (weak) process and 
hence we can invoke Proposition 17.11 to see that its support, the one-dimensional subspace g — Cf^, 
is co-invariant for V. In fact it is easy to check this directly: Consider the one-dimensional subspace 
g := Ctlcf, C f). Because V*\t, = V\ and V\Q,^ = VL^ ® O.^ we find 

V*g = V*CQ„> = Cn^ ®^cg®P 

which implies that g is co-invariant for V and hence we have a subprocess (Q, V , g). Combined with 
the analysis in Section 2] we obtain the surprising result that the GNS-Hilbcrt space of a stationary 
Markov chain always carries automatically the structure of a 7-cascade 

(H,V,V = (G,V G ,g) < 7 {K,V K ,t) 

and we can think of the whole short exact sequence 

-> (G, V g ,g)h (G, V g ,g) < 7 (K, V K ,t) A (£, V K , t) -> . 

as being associated to the stationary Markov chain. We have g = Cfi^ and t — Aflfj, CO^. 

Let us investigate what asymptotic completeness means in this case. We can define the associated 
isometry v n for the n-th noncommutative random variable j n which, because it arises from the same 
iterative procedure, can be expressed by the iteration of V. With a € A and £ := aCl$ £ f) we 
have 

n 

in (a) ^ ® (g) = v n i = VW'Z 
1 

and from that 

n 

1 

which yields the norm equality 

Because Py(n) g ®®™p increases to the projection Pg (which is the limit for n — > 00 in the strong 
operator topology) it follows that the property G = H, equivalent to asymptotic completeness by 
Theorem 16. 31 is also equivalent to 

ll-Pfl®®?^ v nC\\ U\\ for n-t 00 
for all £ £ f). If the GNS-representation is faithful then we can interpret the Hilbert space norm as a 
norm || • H2 on the C*-algebra and we can write 

II^U®<g>?e 3n{a)h ~> IHI2 for n -> 00 
for all a £ .4, where Pi^jg^e denotes the conditional expectation obtained by evaluating the state (/> 
on A 

However the latter condition is well known to be equivalent to the asymptotic completeness of 
the stationary Markov chain in a scattering theory meaning introduced by Kummerer and Maasscn 
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in [301 ] - As mentioned earlier here we followed a variant developed in [20J. The equivalence of the 
convergence in the norm || • H2 above with the property of asymptotic completeness in the scattering 
theory meaning is stated in Lemma 1.5 of [20j. Hence, proceeding as above, all these meanings of 
asymptotic completeness coincide. Let us summarize this insight as follows: 

Proposition 7.2. The short exact sequence of weak processes associated to a stationary Markov chain 
is asymptotically complete in the sense of Definition 16.41 if and only if the stationary Markov chain is 
asymptotically complete in the scattering theory meaning of [3(| or [2(|. A necessary and sufficient 
criterion is given by 

lim Z n (P„) = If, (in the strong operator topology) 

n— »oo 

In fact, the last statement is nothing but criterion (5) for asymptotic completeness from Theorem 
16.31 applied to this special situation. This reproduces a criterion for asymptotic completeness in terms 
of the completely positive map Z (which in [20] is called the dual extended transition operator Z' ', 
see jib] ]. Section 4, in particular Theorem 4.3). Because in this case g — Cf2^ is one-dimensional the 
criterion is also equivalent to the ergodicity of Z 1 i.e. the fixed point set of Z being equal to Of,, see 
[20l |. Section 3, for more details. 

The construction of the scattering theory itself with its intertwiners (M0ller operators) between a 
free and a perturbed dynamics requires a two-sided process (i.e. with time variable in Z for discrete 
time) as in [3(| or the construction of a two-sided extension as in and it would take us too far to 
include it here. The reader who follows the given sources will recognize that the change of basis studied 
in Section [6] relating the weak filtration of ("H,V,f)) to the weak filtration of (G,V ^, g) and given by 
the identity on the one-dimensional g = Ctt,p plus the observability map from t to y+ phis the multi- 
analytic operator associated to the transfer function (on U+), is a GNS-version of the M0ller operator 
in the scattering theory, in the same way as the associated weak Markov process is a GNS-version of 
the stationary Markov chain. If we take into account the identification of Q with the inductive limit of 
the sequence (g^^iV) mentioned in Section[5]and note that because g — CO^ is one-dimensional we 
can identify it further with an infinite tensor product then we arrive essentially at the version 

of the scattering theory worked out in [21], Chapter 2. 

But the setting of this paper is more general. For example we could also consider stationary Markov 
chains which are not originally constructed by tensor products of algebras and nevertheless associate 
weak processes via GNS-construction and apply Proposition 17.11 to find subprocesses (with g = Cf^). 
Or we can study higher-dimensional co- invariant subspaces g. The systematic use of weak processes 
adds conceptual clarity and cohesion to such investigations. 
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